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0.  Abstract. 


/I 


'  The  error  correction  model  for  a  vector  valued  time  series  has  been 
proposed  and  applied  in  the  economic  literature  with  the  papers  by 
Sargan( 1964) .Davidson  et  al.(1978),  Hendry  and  von  Ungern-Sternberg(1981)  and 
has  been  given  a  formal  mathematical  treatment  by  Granger( 1983) .  He 
introduced  the  notion  of  cointegratedness  of  a  vector  process  and  showed  the 
relation  between  cointegration  and  error  correction  models. 

This  paper  defines  a  general  error  correction  model,  that  encompasses  the 
usual  error  correction  model  as  well  as  the  integral  correction  model  by 
allowing  a  finite  number  of  error  correction  terms  which  correspond  to  linear 
combinations  of  the  vector  process  that  are  integrated  of  different  order. 

It  is  shown  that  this  structure  is  inherent  in  the  model  if  it  is  given 
in  autoregressive  form  or  moving  average  form  by  exploiting  the  singularity  of 
the  matrix  function  that  defines  the  model. 

The  theory  is  applied  to  some  examples  discussed  by  Davidson ( 1983)  and 
Harvey (1982) . 


I..A 

»  "  *  *  ~ 4 

irr^ 


Key  words:  Co integration,  error  correction,  non-stationery  time  series. 


A!'_V,A  module. 


k’.  ’■  V.  VV>  ... 


1.  Introduction. 

Many  of  the  current  controversies  concerning  macroeconomic  policy  issues, 
particularly  monetary  questions,  seem  to  derive  from  different  views  of  the 
duration  and  importance  of  short-run  and  long-run  behavior  of  economic  agents. 
The  fact  that  these  controversies  have  to  be  resolved  in  the  empirical  arena, 
can  clearly  be  demonstrated  by  a  growing  number  of  empirical  applications  in 
which  the  dynamics  of  short-run  and  long-run  adjustment  processes  are  being 
modelled.  In  particular,  the  idea  of  incorporating  the  dynamic  adjustment  to 
long-run  steady-state  targets  in  the  form  of  error  correcting  mechanisms,  in 
an  autoregressive  model  for  transitory  short-iun  dynamic  behavior,  originally 
suggested  by  Sargan(1964)  and  further  developed  by  Davidson  et  al.(1978), 
Hendry  and  von  Ungern-Stemberg(1981) ,  Davidson  and  Hendry(1981) ,etc.  seems  to 
have  introduced  a  useful  approach  to  modelling  the  dynamics  of  economic 
behavior. 

The  error  correction  model  is  a  model  that  combines  the  autoregressive 
form  for  the  changes  in  y^  with  existing  economic  theory  as  expressed  in  the 
long-term  static  relation 
(.1.1)  Ay  +  Bz  =0 

or  in  the  steady-state  growth  relation 
(1.2)  Ay  +  B(z  +  dz)  =  0. 

An  example  of  an  error  correction  model  is  given  by 
(1.8)  A1(L)dy(.  +  B1(l.)dzt  +D(L)[Ayt_1  +  Bz^j  -  C(L)*t. 

Applications  of  this  type  of  models  usually  have  proved  very  successful 
in  terms  of  model  fit,  meaningful  estimates  of  parameters  of  interest, 

•ompass i  rig, etc.  However,  for  a  long  time  a  formal  mathematica  l  treatment 
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seemed  to  be  lacking.  Granger( 1983)  was  the  first  to  provide  such  a  basis  by 
introducing  the  concept  of  cointegratedness  between  time  series  and  relating 
that  to  the  concept  of  error  correcting  models.  The  idea  of  Granger,  see  also 
Granger  and  Weiss(1983)  and  Granger  and  Engle(1985),  is  to  start  with  a 
general  model  for  x^.  =  {y^.z^}  expressing  that  is  stationary,  i.e. 

(1.4)  Axt  =  C(LHt 

and  then  showing  that  certain  properties  of  the  matrix  function  C(L)  imply 
that,  the  components  of  are  cointegrated.  This  is  then  used  to  derive  an 
error  correction  model.  The  purpose  of  this  is  to  explain  combinations  of  the 
t’s  in  terms  of  deviations  in  long-term  relations  between  the  non— stationary 
components  of  the  vector  process  {x^},  and  to  identify  these  long-term 
relations  using  the  concept  of  cointegration. 

The  purpose  of  this  paper  is  to  discuss  the  equations  (1.1)-(1.4)  find 
some  related  concepts  from  a  mathematical  point  of  view,  find  their 
.interrelations,  and  provide  a  framework  in  which  their  formal  analysis  can  be 
justified. 

In  doing  so  the  concepts  are  clarified  and  generalised.  We  thus  end  up 
with  a  very  general  type  of  model  for  a  class  of  non-stationary  stochastic 
processes.  Not  all  these  models  correspond  to  interesting  economic  models  but 
their  structure  permits  a  simple  analysis  and  thus  helps  the  understanding  of 
the  interesting  examples  which  are  discussed  in  the  economic  literature. 

The  paper  is  now  organised  as  follows  : 

Section  2  discusses  the  basic  properties  of  some  nun-stationary 
processes.  In  particular  the  role  of  the  starting  values  and  their  influence 
on  the  process  is  discussed. 

Following  Granger ( 1 983)  Section  3  discusses  the  problem  of  determining 


the  long-range  relations  in  a  system  given  by  the  moving  average 
representation 

At  -  C(L)et, 

where  d  is  the  order  of  the  process. We  derive  a  general  form  of  the  error 
correction  model,  that  allows  error  correction  terms  of  different  order.  The 
usual  error  correction  models,  see  Davidson  et  al.(1978),  as  well  as  the 
integral  correction  models,  see  Hendry  and  von  Ungern  Sternberg(1981)  and 
Davidson ( 1983 ) ,  can  be  seen  as  special  cases  of  the  general  model.  Section  4 
discusses  formally  the  same  problem,  but  now  based  on  the  autoregressive 
representation 

A(L)xt  -  dPe-t, 

where  p  is  usually  zero. 

Conditions  for  this  model  to  be  interpreted  as  an  error  correction  model 
are  formulated,  and  the  order  of  the  process  is  found.  In  section  5  we 
discuss  the  special  situation  where  x^_  is  decomposed  into  endogeneous  and 
exogeneous  variables  and  derive  an  error  correction  model  for  the  targetting 
error. 

In  section  6  we  show  how  some  examples  from  the  economic  literature  can 
be  treated  by  the  general  methods  developed  in  the  previous  chapters,  and 
Section  7  contains  the  mathematical  results  which  consist  of  finding  a 
representation  of  the  determinant  of  a  matrix  valued  function  in  terms  of 
certain  indices  defined  by  the  null  spaces  of  succesive  derivatives  of  the 
function  at  L  ~  1 

2. _ Tiv ■  basic  propert  ies  of  a  class  of  non-stat. ionary  processes. 

It  is  customary  to  consider  non-st  at  ionary  processes  {x^_}  given  by  the 
c'ui  it  ion 


(2.1) 


21  *t  =  C(I.)et 


where  x^  and  e  R  ,  and  C(z)  is  an  axra  matrix’  valued  holomorphic  function 

given  by  its  power  series  with  radius  of  convergence  1  +  p,  p  >  0,  and  d  is  an 

integer.  Here  {t^}  is  a  sequence  of  independent  identically  distributed 

random  variables  with  mean  0  and  variance  matrix  T.  Note  that  the 

coefficients  of  C(z)  decrease  exponentially  fast,  which  shows  that  C(L)e^  is  a 

stationary  process.  It  is  easy  to  construct  x^.  recursively  starting  with  t  = 

0,  say.  This  is  done  as  follows:  We  sum  (2.1)  from  t  =  0  to  t  =  T  and  find 

T 

zld-1x  =  dd_1x_1  +  C(L)Z  e.  . 

1  1  t=0  Z 

Summing  again  gives 

dd_2x  =  (t+l)dd_1x  +  /-2x  +  C(L)  Z  e  . 

1  11  0<u<s<tU 

It  is  seen  that  the  process  x^.  will  be  composed  of  two  parts.  The  first  is  a 
polynomial  of  degree  d-1  with  coefficients  depending  on  the  past  values  of  x  , 
i.e.  for  t  <  0.  The  second  term  is  a  repeated  sum  of  the  e’s.  We  can  write 

xt  =  C(L)d~det  +  P^jU). 

Vc  call  _^(t)  the  completely  deterministic  part  of  x^  and  C(L)d  de^  the 
random  part  of  x^..  Note  that  A  ^  is  defined  as  a  finite  sum  from  0  to  t.  The 
completely  deterministic  part  can  be  considered  a  trend  in  the  system  showing 
the  influence  of  the  past,  whereas  the  random  part  contains  the  cumulative 
effect  of  the  shocks  to  the  system.  As  a  simple  example  consider  the  equation 

At  =  £t,  t  >  0. 

The  solution  is 

t 

X  =  X_.  +  (t+l)dx_1  +  I  £ 

1  1  1  0<s<u<ts 

Note  that  the  trend  part  is  just  a  straight  line  through  points  (-2,x_2)  and 
(  1.x  j}  and  that  conditionally  on  these  values  x^.  fluctuates  around  this 


— i 


-4  - 


Mr 


tr.  d  with  a  variance  given  by 

*  2 

V(x ,|x  ,,x  .,)  Z  (t-s+D^VU,). 

1  s  0 

the  variance  increases  to  infinity  with  t.  This  is  often  expressed  by 
£-.a;-:ng  that  the  process  has  infinite  variance.  A  stationary  process  can  be 
started  at  "minus  infinity"  but  a  non-stationary  process  must  be  started  at  a 
finite  time  point,  and  the  whole  process  has  to  be  considered  conditionally  on 
the  values  before  this  time,  otherwise  the  process  is  simply  not  defined  by 
the  equation  (2.1).  One  can  subtract  P^_^(t)  from  x^,  since  the  difference 
also  satisfies  the  differential  equation,  but  now  with  the  starting  values 
Zero. 

This  problem  has  implications  for  some  of  the  formal  calculations  often 
applied  to  time  series.  Consider  for  instance  the  equation 

4dxt  =  C(L)dbet  ,  t  >  0. 

If  ■,  e  sum  b  times  we  obtain 

=  C(L)et  +  Pb_x(t), 

whore  P,  .  (t)  has  coefficients  depending  on  the  values  of  (x  ,c  }  with  t  < 
b-1  £  t 

0 . Thus  one  can  cancel  <db,  at  the  expense  of  adding  a  trend  of  order  b-1.  One 
can  also  justify  the  cancellation  of  <db  as  an  operation  on  the  random  part  of 
the  process.  In  the  following  we  shall  in  some  examples  be  explicit  about  the 
tt  c  d,  but  the  later  examples  only  the  random  part  will  be  dealt  with  in 
d  -t  11. 

Next  we  turn  to  the  notion  of  cointegration,  see  Granger (1033) . 
Definition. 2. 1.  We  shall  call  x^  integrated  of  order  d  if  x^  has  the 
rep. nsentat i on 

At  =  C(I.)et  4  Pb(t) 

v;ii  •  1  P,  (t)  is  a  completely  determin  ist  ic  polynomial  of  decree  b,  and  C(l)  t 

u 


0.  In  other  words  x  is  integrated  of  order  d  if  d^x  is  stationary  apart 
from  a  completely  deterministic  component.  Notice  that  C(l)  r  0  implies  that 
A ^  ^x^  is  not  stationary. 

Definition  2.2.  Let  x^.  be  integrated  of  order  d.  We  shall  call  x^. 
cointegrated  with  cointegration  vector  a  «  if  of  order  s  if  a’xt  is  integrated 
of  order  d-s. 

Thus  the  order  of  x^  is  reduced  by  s  if  the  combination  a’x^  is 
considered. 

It  is  mathematically  convenient  to  allow  any  vector  a  in  the  definition 
of  cointegration.  Thus  if  a  =  (1,0,0)’  say,  then  oc'x^  =  .  We  thu3  express 

the  fact  that  some  components  of  x^  is  in  fact  integrated  of  lower  order  than 
the  whole  vector  process,  by  saying  that  a  certain  unit  vector  is  a 
reintegration  factor.  This  is  clearly  a  slight  abuse  of  the  idea  behind 
cointegration  but  it  makes  the  formulation  simpler. 


3.  The  error  correction  model  der i ved  from  the  moving  average 
representation. 

We  shall  consider  the  equation 
(3.1)  <ddxt  =  C(L)e.  ,  t  >  0, 

and  assume  that  C(z)  is  holomorphic  for  |z|  <  1+p  ,  and  is  non-singular  for 
|z|  <  1+p  except  for  z  =  1,  where  we  assume  that  C(l)  is  singular  but  A  0, 
since  if  C(l)  were  0  then  a  similar  model  would  hold  with  d  replaced  by  d— 1. 
Mote  that  we  do  not  assume  that  each  af  the  components  of  x^  are  integrated  of 
the  same  order.  Such  an  assumption  is  not  necessary  for  the  results  developed 
below,  but  in  connection  with  the  examples  this  point  will  be  discussed  in 
more  detail.  We  want  to  derive  on  error  correction  model  for  x^.,  following 
the  ideas  of  Granger ( 1983) .  We  shall  first  give  a  general  definition  of  an 
error  correction  model  and  then  give  some  examples  before  we  prove  the  main 
result. 

The  ultimate  goal  of  this  investigation  is  to  be  able  to  find  the 
properties  of  a  vector  process  x^  from  the  defining  equation  in  the 
autoregressive  form 

A(L)xt  =  ,  t  ^  0. 

The;  equation,  which  defines  the  process  uniquely,  must  therefore  contain 
information  on  the  order  of  the  process  and  of  which  components  are 
co integrated.  The  problem  is  how  to  extract  this  information.  If  we  can  find 
out,  that  the  order  of  integration  is  2,  say,  then  we  can  write  the  equation 
in  the  form 

Vt  *  dVt  *  v^t  *  £t 

and  we  want  to  interpret  A^x^  and  dA^x^  as  stationary  error  correction  terms. 
Thus  in  order  to  interpret  this  as  an  error  correction  model  we  must  make  sure 


Z  zlVxj.  +  A(L)dx  =  f(L)At  ,t  >  0 


that  all  terms  represent  stationary  terms  and  that  they  have  not  been 
differenced  too  much.  This  is  made  precise  in 

Definition  3. 1  A  model  of  the  form 

k-1  .  , 

.k 

U.X.  f  A {.LJt 

it 

1--S 

is  called  a  general  error  correction  model  of  order  k  if 

(3.2)  A(z)  is  holomorphic  for  |z|  <  1+p  and  A(l)  #  0 

(3.3)  ^D.x  is  stationary,  i  =  -s,...,k-l 
(3. A) 

(3.5)  f(z)  t  0  ,  |z|  <  1+p. 


x^  is  integrated  of  order  k 


The  terms  dXD.Xj_,  -s  <  i  <  k  represent  error  correction  terms  with 

cointegration  factors  F.^  of  order  (>)  k-i,  and  the  term  A(L)^x^  gives  the 

k 

autoregressive  model  for  the  stationary  process  A  x^..  In  general  d  =  0  but  in 

some  cases  we  need  a  different  value.  Note  that  if  i  <  0  then  the  term  z^D.x, 

l  t 

i.*  no  integral  correction  term,  i.e.  the  Jij  fold  summation  from  0  to  t  of 

Vf 

Some  examples  will  be  given  below 

F.xample  3.  U  Consider  the  process  (c^>y^.)  given  by  the  equations 
dct  =  Myt  +  i(yH-ct_i)  +  eu 
Ayt  =  e2t 

Tiiis  example  has  been  adapted  from  example  6.1  and  is  treated  in  more  detail 
in  Section  6.  It  is  easily  seen  that  from  the  second  equation  it  follows, 
that  y^  is  integrated  of  order  1.  Now  the  first  equation  only  makes  sense  if 
( (  is  integrated  of  order  1  as  well ,  in  such  a  way  that  y  -c  is  integrated  of 
order  0,  since  if  for  instance  were  integrated  of  order  2,  then  the  left 
hard  side  would  he  of  order  1  and  the  right  hand  side  of  order  2.  Thus  y  and 
c  ere  reintegrated  with  cointegrnt ion  vector  a’  -  (1,1)  of  order  l,  and  tin* 


model  for  Ac^  is  given  in  part  by  the  autoregressive  term  /Sdy^  and  in  part  by 

the  stationary  error  correction  term  y^ 

Thus  one  can  in  a  simple  fashion  identify  the  order  of  the  vector  process 

(<:  y  )  as  well  as  the  autoregressive  part  and  error  correction  part  in  this 

2 

example.  Note  that  if  the  second  equation  is  replaced  by  A  y^  =  &  t  then  the 
analysis  changes  and  c^  and  become  integrated  of  order  2  arid  (y^-c^) 
becomes  integrated  of  order  1.  In  this  case  the  first  equation  should  be 
multiplied  by  A  before  one  can  identify  the  autoregressive  part  and  the  error 
correction  part,  and  then  the  error  correction  part  is  not  a  linear 
combination  of  the  components  of  the  process,  but  a  linear  combination  of  the 
components  of  the  differenced  process. 

I  i  pie  '<  .2.  A  modified  version  of  the  previous  example  is  given  by  the 


(  quat.  ion 


Ac 


ml 


t  r-  ^yt +  (yt-rct-i)  +  £ 


it 


^yt  =  e2t 


where  A  y.  -  Z  7.  .  A  full  treatement  of  a  similar  example  is  given  in 
1  s  0  S 

Section  6.  At  this  point  we  shall  use  it  to  indicate  that  it  may  not  be  so 
obvious  to  find  the  order  of  the  process  ai'd  find  out  in  what  sense 

these  equations  determine  an  integral  correction  model.  We  shall  only  note 
that  in  general  the  error  correction  terms  are  stationary  terms  which  are 
linear  combinations  of  the  vector  process  differenced  a  suitable  number  of 
t lines.  Finally  one  can  combine  the  two  examples  and  consider  an  equation  where 
both  the  error  correction  term  as  well  as  the  integral  correction  term  appear. 
Thus  in  general  one  can  have  many  error  correction  terns  in  an  error 
c-.rroction  model  corresponding  to  linear  combinations  which  are  integrated  of 


il  i  f  Cerent  orders . 


In  order  to  state  the  main  result  about  the  error  correction  model  we 


have  to  define  three  indices  determined  by  the  function  C(z).  The  necessary 
mathematical  results  are  given  in  Section  7,  but  here  we  shall  briefly 

recapitulate  the  definitions  and  results.  From  the  assumptions  on  C(z)  it 

00 

follows  that  we  can  expand  it  around  z  =  1  in  a  power  series  C(z)  -  Z  (l-z)JC  . 

j=0  j 

which  is  convergent  for  |z-l|  <  p.  The  results  of  this  section  are  formulated 

in  terms  of  the  coefficients  {C.}.  We  now  let  N.  -  {x  e  R™ |  x’C.  =  0  },  i.e. 

J  J  J 

the  null  space  for  C..  We  then  define  the  spaces  M.  =  If.  D. .  .fl  N .  of  vectors 

J  J  0  j 

which  are  null  vectors  for  all  matrices  C.,  i  =  Let  n.  denote  the 

i  J 

dimension  of  M..  Clearly  the  spaces  M  are  decreasing  and  since  C(z)  is 

assumed  to  be  regular  for  z  £  1  there  is  no  vector  x  which  is  contained  in 

i  ii  N..  Hence  there  exists  a  k  such  that 
J 

m  >  in.,  >  . . .  >  m,  ,  >  m,  =  m,  ,=...=  0. 

0  k-1  k  k+1 

00  k 

No  ■>  define  n  =  Z  in .  =  Z  m.  and  let  r  be  defined  by  det  C(z)  =  (l-z)rf(z) , 
j=0  J  j=0  J 

where  f(z)  £  0. 

Thus  we  associate  with  C(z)  the  three  numbers  (k,n,r)  which  will  be  used 

repeatedly  in  the  following.  We  define  C  (z)  by  the  relation 

n 

n~l 

C(z)  -  Z  (l-z)JC  +  ( l~z)nC  (z),  \z\  <  1+p, 
j=0  J 


and.  the  adjoint  C(z)  by 


C(z).  -  (  -l)1+Jdet  CU(z) 


wh  ce  C  (z)  it;  found  by  deleting  row  ,j  and  column  i  from  C(x).  Let 

n  1 

C(z)  r.  A  (1  z)JC  K 1  z)nc  (z)  ,  |z|  <  Up. 

,j(]  J  n 

Vf  <  eri  then  formulate 

Theorem  3.1.  The  process  x  given  bv  (3.1)  satisfies  an  autoregressive 


I 


model  of  the  form 


(3.5) 


2  Ad  .x.  +  C  (L)^dxt  =  f (L)<dr  V  ,  t  >  0. 
...  n-j  t  n  t  t 

J 


If  either  C  =  C  (1)  #  0  or  if  r  -  n  this  is  a  general  error  correction  model 
n  n 

of  order  d. 

Proof .  Theorem  7.3  shows  that  we  have  the  following  representation 

,  ,  k-1 

(3.6)  C(L)  =  /*  Cn_k(L)  =  k[  2  dJcn  k+j  +  zTC^D] 

J  d 


(3.7) 


C  .C(L)  =  A  C  .C.(L)  ,  j  -  1»«  *  •  >k 
n-j  n-j  j 


where  C  .(z)  and  C  (z)  tire  holomorphic  in  |z|  <  1+p.  Now  the  equation  (3.1) 
3  ^ 

defines  as  a  stationary  process,  hence  C(L)ddx^  is  well  defined  and 


stationary  and 


C(L)ddxt  =  C(L)C(I.)et  =  f(I.)<dret 


where,  by  Theorem  7.1,  f(z)  f  0,  |z|  <  1+p.  Now  use  the  representation  (3.6) 


and  we  get 


v;hich  implies  that 


n-1  , 

[  2  AJC  .  +  dnC  (L)]/x,  =  f(L)dre. 

,  j  n  t  t 

J=n-k 


(3.8) 


2  Ad  .x,  -h  C  (L)Adx^  =  f(L)dr  . 

n-j  t  n  t  t 


For  this  equation  to  be  an  error  correction  model  we  have  to  check  the 
conditions  (3.2)-(3.5).  We  know,  since  C(l)  £  0,  that  is  integrated  of 

order  d  which  shows  (3.4).  By  multiplying  (3.1)  by  .  and  using  (3.7)  we 


(3  0) 


V/  \  =  Vjcj(t)fet  ■  j  n  1 . k- 


O  I 


which  shows  that  A  is  stationary  and  hence  that  (3.3)  is  satisfied. 

Since  C  (z)  is  holomorphic  and  f(z)  ^0,  |z|  <  1+p  we  only  have  to  check  that 

n 

(1)  £  0.  Thus  if  C  =  C  (1)  i*  0  ,  then  (3.8)  is  an  error  correction  model, 
n  n  n  ’ 

and  if  instead  we  assume  that  r  =  n,  then  it  follows  from  Theorem  7.4 

k  -  I 

that  I  C  .C  .  has  rank  m.  Now  since  C-,  =  C(l)  #  0  we  have  M_  #  {0},  and 
j_0  n_J  J  0  0 

hence  there  exists  A  £  0,  A  e  Since  the  matrices  {C  C  ,  }  span 

0  n-1  n-k 

11^  we  find  that_ 

k  _  _ 

0#A’K  .C.  =  A’C  Crt 

j=0  n-j  j  n  0 

which  implies  that  f  0  and  the  Theorem  is  proved. 

The  first  term  in  the  error  correction  model  is 

.d-k~ 

A  C  ,  x 
n-k  t 

which  will  represent  an  integral  correction  term  if  d  <  k. 

It  is  seen  that  if  r  >  n,  then  a  difference  Ar  n  remains  at  the  e’s. 

This  has  the  effect  that  when  solving  for  x^.,  one  has  to  sum  the  error 

r~n 

correction  terms  as  well  as  the  term  A  The  last  one  will  contribute 

less  to  the  variance  of  the  process  than  the  first  one,  and  in  thi3  sense  the 
Main  contribution  to  the  variance  of  the  process  comes  from  the  error 
correction  terms.  Thus  one  has  in  fact  succeded  in  explaining  the  major  part 
of  tiie  e’s  in  terms  of  interpretable  error  correction  terms.  If  r  =  n  there 
is  a  balance  between  the  two  kinds  of  errors,  and  we  therefore  call  this  the 
ha  i. triced  case.  We  shall  see  in  the  examples  that  both  situations  can  occur 
evo  in  the  examples  that  have  been  token  from  the  econometric  literature.  We 


ill  also  see 


that,  one  can  always  reduce  the  unbalanced  case  to  the  balanced 


y  introducing  new  variables.  This  will  be  discussed  in  detail  in 


Section  7,  and  in  connection  with  the  examples  3.4,  6.1  and  6.2. 


Example  3.3.  Consider  the  system 


Tx  1  f  1  d  £  1  fe  1 

d2|x  1  J  1  -d  2d2  1  e  1  t  >  0 

[X3tJ  j  0  -2^1  o  J  |fc3t  J 

If  we  express  C(L)  as  a  function  of  d,  we  get 

ri  o  o']  [oio]  ro  o  1 1 

C(L)  =|l  0  0|  +  A  |0  -1  0  j  +  A2  |o  0  2  |  =  C  +  AC  +  A2 C  . 

to  0  0J  L0  -2  0  J  L0  0  0  J  u  1  * 

It  easily  follows  that 

r  n  roi  r  n 

n  =  sp{  | —1 | ; I o } >  ;  N.  n  N,  -  sp  | —1 | ;  n  n  N  n  N  =  {0} 

u  t  0J  tlj  u  1  L  lj 

and  hence  =  2,  =  1,  =  . . .  =0.  Thus  k=2,n=3  and  since  det  C (A)  = 

we  have  r  =  n  =  3.  The  adjoint  matrix  becomes 

f  4 A3  -2 A3  3 A3  1 

C(d)  =  |  0  0  -A2  | 

t  -2 A  2 A  -2 A  J 


TO  0  0]  ^  0  01  „f4  -2  31 

=A  [  1 0  0  0|+  A  |0  0  -1 1  +d‘ 1 0  0  0 1  ] 

L-2  2  -2j  lo  o  oj  to  0  0J 

=A  [  C1  +  AC2  +  d2C3] 

The  equation  now  becomes 

A3  [  Cl  +  dC2  +  d2C3]xt  =  2A3e-t. 

Now  cancel  dn  -  A3  and  find  the  terra  corresponding  to  d  =  2  which  gives  the 

autoregressive  part  of  the  model.  Then 

2-  2 
A  C3x^  -C^x^.  ~  ^C2Xt  +  2et  +a  +  ^>t  +  c^ 

where  a,b,and  c  are  determined  by  the  values  before  time  t.  =  0. 

Now  insert  the  three  matrices  C.  and  we  get 


.2  [  4xlt  2x2t+3x3t^ 


l 


0 

0 


r  o  i  r  o  i  r  e  i 

I  =  2|  0  I  +  d|  x  |  +  2|  e,ZT  |  +  P  (t) 

J  ^xltx2t+x3t  L  0  J  L  e3ti 


^f4xlt~2x2t+3x3t]  =  2tlt  +  S2(t) 

2^xlt_x2t+x3t )  =~  2e3t  +  Q2^ 
^x3t  =-  2e.2t  +  R2(t). 


or  equivalently 

(3.10) 

(3.11) 

(3.12) 

Note  that  the  equations  (3.11)  and  (3.12)  are  special  cases  of  (3.9) 
which  express  that  certain  linear  combinations  are  integrated  of  lower  order 
than  2.  From  (3.11)  we  see  that  xit~x2t+x3t  *s  s^ati°nary  apart  from  the 
trend.  Note  also  that  and  x^  are  integrated  of  order  2  and  i® 

integrated  of  order  1,  but  the  variable  that  makes  x^_x2t  integrated  of  order 
1  is  the  same  as  that  in  -x^,  hence  xit"x2t+x3t  *s  re<iuce<i  to  stationarity. 
Thus  (3.11)  is  an  example  of  a  cointegration  relation  where  all  the  components 
are  not  necessarily  integrated  of  the  same  order.  Note  that  the  example  has 
n  =  r  =  3  and  in  this  case  the  rows  of  span  whereas  the  rows  of  and 

C‘2  span  Mq  ,  see  Theorem  7.4. 

Example  3.4.  Let  x^  be  given  by  the  equations 

.pa]  _  f 1  ♦ J  -d  lhd 

U2tJ  L  a  0  J  U2tJ 
r  i  oi  r  i  -ii  _  r  o  oi  r «  n 

C(L)  =|  |  +  A  |  |  ;  C(L)  =  |  I  +  A\  | 

L  0  0J  L  1  0J  L  o  lj  L-1  1J 

and  det  C(L)  =  A 2.  We  find  Nq  =  spj^j  un0  Nj  =  sp  ,  hence  Mq  =  and 
-  ,  giving  the  indices  k  =  1,  =  1,  m^=0,  and  hence  n  =  1,  whereas  r  = 

2.  Thus  we  are  in  the  unbalanced  case  but  C  =  C,  £  0  and  we  can  apply  Theorem 

n  1 


3.1.  In  the  equation 


C(L)dxt  =  A e 


wc  can  cancel  A  -A  and  the  resulting  equation  is  a  general  error  correction 
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model.  The  autoregressive  part  is  then  given  by  the  terms  involving  2Lx^.  We 
then  get 


r  o  n  =  _r  o  oi  + 

L-i  lj  t  [  o  ijxt 

which  shows  that  is  stationary  and  hence  that  (0,1)  is  a  cointegration 
vector  of  order  1.  The  last  equation  is 

^x2t_xlt^  =  _x2t  +  ^&2t’ 

where  the  left  hand  side  is  the  autoregressive  part  of  the  equation  and  on  the 


rigth  hand  side  x^  represents  the  error  correction  term  and  tg  the  shocks. 
If  we  solve  this  for  x2t~xlt  t^ien  maj°r  contribution  to  the  variance  will 


be  the  term  >  since  V(  2  x2s>  increa3e3»  whereas  V(  2  )  is  constant  in 

s=0  s=0 

t.  Thus  the  case  r  >  n  can  be  interpreted  as  the  case  where  the  major 
contribution  to  the  variance  is  given  by  the  error  corection  terms,  whereas 
the  shocks  only  play  a  minor  role.  Another  way  of  expressing  this  is  that  we 
have  combined  the  major  part  of  the  shocks  into  an  expression  which  we  can 
give  an  interpretation,  namely  the  error  correction  term.  The  case  r  >  n, 
however,  can  be  reduced  to  r  =  n  by  transforming  the  variables  into  new 
variables  which  are  linear  combinations  of  variables  of  the  same  order  of 
integration,  namely 


ylt  =  xlt 

y2t  =  +  x2f 

The  choice  of  these  new  variables  is  based  on  an  analysis  of  Ihe  matrices  (C.) 
such  that  the  variables  are  linear  combinations  of  suitably  differenced 
components  of  x^.  This  is  discussed  in  more  detail  in  Section  7. 

The  equation  for  the  new  variables  is  found  by  multiplying  by  the  matrix 

r  i  oi 

i-d  ij 


-  1  s  - 


and  we  find 


jy.ti  r  i  o  u  i+A  vii  [i  i  r  -dirw.i 

i  ji  i  oju‘tVi-/  /ji4‘j 

The  relevant  matrix  functions  are  now  given  by 


cal  -  r 1  oi  r  i  -n  +/  r  o  oi 

C(L)  L  o  oj  ai  o  oj  A  L-i  1J 


cm  -  r  0  oi  r  o  n  2[  l  oi 
cm  L  0  +a^  0  Xj  +a  ^  1  0J 


and  det  C(L)  =  A?.  In  this  case  =  spj^j  »  =  sp[lJ  *  ^2  =  SPjoj  * 

hence  =  Nq,  -  Nq  fl  but  =  {0}.  Thus  =  1,  m^  =  1,  =  0 

which  shows  that  k  =  2  and  n  =  r  =  2.  The  error  correction  model  looks  like 

r  o  oi  -i  +  r  o  n  +  ri  oi  = 

l  o  ij  yt  [o  ijyt  Li  oj^t  t. 

The  first  two  terms  represent  error  corrections  and  show  that  A  ond 

are  stationary,  whereas  y^  satisfies  the  autoregressive  model 

^ylt  =  £lt  ~  y2t 
dylt  =  e2t"y2t~d  y2t* 

These  equations  are  clearly  equivalent  in  view  of  the  representation  of  y  as 
n  function  of  The  first  analysis  of  this  example  showed  that  x^^.  is  in 

fact  stationary.  A  fact  which  is  obvious  from  the  defining  equation,  since 
one  can  cancel  a  A  in  the  equation  for  *2t‘  ^  Pr°bal>ly  a  good  idea  to 

start  out  with  variables  which  are  integrated  of  the  same  order.  The  second 
analysis  offers  a  more  interesting  point,  since  we  find  that  not  only  is  y^  = 
X2t  “^lt  stationary-  but  *n  integrated  of  order  -1,  which  shows  that 

x^-d  is  stationary.  We  have  thus  found  a  cointegrnt.ion  relation  between 

two  variables  of  the  same  order  ,  x^  and  A 
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4.  The  error  correction  model  derived  from  the  autoregressive 
repres entation 


Let  the  process  {x^,t  2:  0}  be  given  by  the  equation 
(4.1)  A(L)xt  -  <dPe-t  ,  t  >  0, 

where  A(z)  is  holomorphic  for  |z|  <  1+p  and  non-singular  for  z  £  1  ,  but  A(l) 

A  0.  We  define  the  coefficients  {A^}  by  expanding  A(z)  around  z  =  1,  A(z) 

00 

-  2  (1 ~z)  A.  ,  |z|  <  p.  We  want  to  interpret  the  equation  (4.1)  as  an  error 
i=0  1 

correction  model,  see  Definition  3.1.  For  this  it  turns  out  that  we  need  to 
calculate  the  numbers  (k’.r’.n’)  for  the  transposed  matrix  function  A(z)*. 

Theorem  4.1.  The  process  x  given  by  (4.1)  is  integrated  of  order  less 
than  or  equal  to  r'-n’+k’-p  and  in  the  expansion 


(4.2)  AV  n  ~PA(L)x. 


k’-l 
=  I  A 
j~-0 


r  -n  +j-p 


A  .x. 
J  t 


+  A^, (1) A 


r’-n’+k'-p 


xt  =f(L)dr  n  et 


all  terras  are  stationary.  If  either  A  ,  ,  ,  A  0  or  r*  =  n’  then  (4.2)  is  an 

n  k 

error  correction  model  of  order  r’-n'+k’-p. 

Proof .  It  follows  easily  from  the  definition  of  the  adjoint  matrix,  that 


(A’(z))  =  (A(z))’  and  hence  that  we  have  from  (7.4) 

A(I.)  =  dn’_k’An,_k,(L) 

and  from  (7.6) 

(4.3)  A.A  ,  (L)  =  dk’_JA.A  ,  .(I.)  ,  j  =  l,...,k. 

J  n  -k  j  n  j 

Now  multiply  (4.1)  by  A(L),  then  we  get 

f(I,M'xt  -  d"'  k’fpAn,_k,(L)e.t 


which  shows  that. 


is  stationary.  If  An , , ( 1 )  £  0  then  x^  is  integrated  of  order  r’-n'+k’-p. 

If  r*  =  n’,  then,  by  Theorem  7.4,  the  rows  of  A^,^,  span  which  i3 

non-empty  find  hence  again  A^,  ^,  *  0  such  that  x^.  is  integrated  of  order  k’-p 
which  proves  (3.4). 

Now  multiply  (4.4)  by  A.  and  we  get  from  (4.3)  that  the  first  terms  of 

J 

the  expansion  vanish,  and  that 


r-n’+k’  Pf(L)A  =  4k’  JA  A  .(L)e 
j  t  j  n’-jv  '  t 


which  shows  that 


.r’-n’+j-p 


A  .x  ,  j  —  1,  —  ,k 

J  ^ 


is  stationary  which  proves  (3.3).  Since  A^,  =  A^,(l)  £  0  ,  by  the  definition 
of  k’ ,  it  follows  that  we  have  an  error  correction  model.  In  the  case  when  r* 


>  n’  one  may  get  A  ,  ,  ,  =  0  in  which  case  x  will  be  integrated  of  lower  order 
n  K  u 

and  one  may  have  to  cancel  some  more  powers  of  A  before  the  model  can  be 


interpreted  as  an  error  correction  model,  but  the  condition  that  *  0 

ensures  that  no  power  can  be  cancelled  and  that  the  results  hold. 

Example  4.1.  Consider  the  equations 

Xlt  “  X2t  +  ^2t  =  elt 
(4.5)  ^xlt  ~  x2t^  "  e2t' 

It  is  clear  from  the  second  equation,  that  xu--x2t  integrated  of  order  1  , 
and  hence  it  follows  from  the  first  equation  that  is  integrated  of  order 

1  and  hence  that  x^^.  is  of  order  2.  Thus  all  terms  are  not  stationary,  which 
means  that  (4.5)  can  not  directly  be  interpreted  as  an  error  correction  model 
in  the  sense  of  Definition  3.1.  We  shall  give  a  formal  analysis  as  follows: 
We  find 


A’  ( I.) 


r  i  di 

[~l+A  -A\ 


r  i 

H 


oi 

0J 


+  A 


r  o 

i  i 


n 

-n 
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giving  k’=  l>mg,=  l>m2,=  0  and  n*  =  1,  while  r’  =  2.  The  expansion  of 
Ar  n  ^A(L)xt  now  reduces  to  multiplying  through  by  A  in  the  equation 


defining  x^,  and  we  get 


^(xlt~x2t^  +  *  X2t  = 


A 


it 


A  X2t  Ae2t 


Theorem  4.1  now  gives  the  order  of  x^  is  less  than  or  equal  to  r’-n’+k’-p 

=  2.  Clearly  r*  >  n’,  but  it  is  easy  to  see  that  An,_j_,  =  A^  £  0,  which  shows 
that  the  order  in  this  case  is  equal  to  2  and  that  (4.5)  has  to  be  multiplied 
by  A  to  become  an  error  correction  model,  and  that  the  error  correction  term 
becomes  ^(x.^-Xg^).  Thus  in  order  to  interprete  the  equation  as  an  error 
correction  model  one  first  has  to  multiply  by  A. 

5.  Gran ger  causality. 

We  shall  consider  the  special  case  of  (3.1)  where  x  =  (y  ,z.)’  and  C(L) 
and  are  partitioned  accordingly 

dr  y.i  r  pu)  QU)ir»ui 

B(L,JU“j 

It  is  easily  seen  that  det  C(L)  =  det  P(L)  det  R(L)  and  that 


T  P( L)  det  R(L)  -P(L)Q(t)R(L)l 
C(L)  =  |  _  | 

L  0  R(L)  det  P(L) J 

Apart  from  the  assumptions  on  C(z)  stated  in  section  3  we  shall  assume  that 
R(z)  is  non-singular  and  that  P(l)  A  0.  Then  one  finds 

I«irlAt  =  e2t 

and 

(5.1)  ^ld(yt  -  Q(L)H(L)_1zt)  =  P( L)&lfc. 

One  can  define  It( L)  =  <)(I,)R(I.)  ^  and  let  7T(1)  =  fl  be  the  impact  of  on  y  . 

Vie  shall  call  JT(L)z^  the  revealed  target  and  y^  -  7T( L) the  target  error,  see 


Kloek( 1983) .  We  let  (k,n,r)  denote  the  indices  for  the  matrix  function  P(z) , 


and  get  from  Theorem  3.1,  that  if  r  =  n  or  P  ^  0  then  we  have  the  error 

n 

correction  model 

k  ,  , 

(5.2)  2  da_JP  (y  -  ff(L)z  )  +  P  (L)dd(y  -  ff(L)z  )  =  f(L )Ar  nc 

.  .  n-j  t  t  n  t  t  it 

J=1 

We  shall  interprete  this  equation  as  follows:  From  (5.1)  it  follows  that  the 
target  error  is  integrated  of  order  d,  since  P(l)  #  0.  Hence  y^  can  be  tracked 
by  the  target  ZT(L)z such  that  the  difference,  the  target  error,  becomes 
integrated  of  order  d.  The  error  correction  terms  in  (5.1)  signify  that 
certain  linear  combinations  of  the  variable  y^  can  be  tracked  closer,  in  the 
sense  that  these  linear  combinations  of  the  target  errors  are  integrated  of 
lower  order,  or  in  other  words  the  target  error  is  cointegrated. 

Davidson( 1983)  compares  the  dynamic  target  IT ( L)z^,  relevant  for  a 
steady-state  growth  world,  with  the  static  target  Itz^  relevant  for  a  static 
equ librium  world.  He  then  calls  H(L)  trend  neutral  of  order  m  if  7T(L)tJ  =  ZftJ 
for  j  =  0,1,..., m,  and  derives  the  restrictions  to  be  placed  on  the  structural 
parameters  of  the  equation  system  for  this  to  hold. 

Let  us  next  assume  that  the  equations  are  given  in  the  autoregressive 


form: 


f  F(L)  G(L)iry,l  -.PTe,.! 

L  0  H(L)J |z£j  L&2 t J 

;ume  that  H(z)  is  non-singular  ,  and  that  F(l)  £  0. 


The  first  equation  is 

F(L)yt  +  G(L)zt  -  dPeu 

which  can  be  written 

(5.3)  F(L)(ytrd'rf(L)“1F(L)G(L)zt)  =  dPelt. 

Here  (k’jU’.r’)  are  the  indices  for  the  matrix  function  F(L)‘.  Hence  the 


revealed  target  is 


ff(L)  =  -A  r’f(L)  1F(L)G(L)zt 

where  JT( L)  can  be  replaced  by  the  first  terms  in  the  expansion 

p-1  . 

ff(L)  =  2  AJII .  +  AVn  (L) 
j=-r*  J  P 


since  H  (L)A^z.  is  stationary.  Now  in  case  r’  =  n’  or  F  ,  ,  ,  £  0  the  equation 
p  t  n  ~K 

(5.3)  gives  rise  to  an  error  correction  model  for  the  target  error  of  the  form 

k’-l  ,  -  * 

r’-n’+j-p. 


I  A 
j=0 


?Fjtyt"ff(L)zt]  +  Fk-(L)fyt_zr(L)2t1  =  f(L)<dlr  n  elt 


of  order  r'-n’+k’-p.  A  simple  example  of  this  is  given  in  Section  6. 


'T’W.'-"-.-'-.  *  ' 


6.  [examples.  Consider  the  model  proposed  by  Hendry  and  von 
Ungern-Sternberg  (J981)  and  discussed  by  Davidson  (1983). 

Exaniple  6._1  Let  (c^.y^.l^)  denote  the  logarithm  of  consumption, 
disposable  income,  and  personal  sector  liquid  assets  respectively.  The  model 
takes  the  form 
(6.1) 

(6.2) 


=  *  ’n(ye-r  Vi>  +  7i2(J't-r1t-x)  -  ‘u 


alt  ’21lyt-l  Ct-lJ  +  t2f 
To  complete  the  system  we  shall  add  a  third  equation  explaining  how  is 

generated  by  Then  the  equations  will  have  the  form  discussed  in  Section 

5  ,  and  we  can  solve  for  c^  and  1^  in  terms  of  y^  and  Let 

f(L)  -  dijj  +  d2(l— r^j)  -  ■»  12*r23  (1~^)2* 

We  shall  assume  that  f(z)  9=  0  for  z  £  1.  Then  we  find 

[  a  *  ’nl  ’121  1  N  ,  f  ^  +  ’ll1  +  712L  ]y  ♦  f*ul 

(  t  r  I  ^  I  .  I 


721L 


J  UtJ  L 


721L 


J 


*-£2f* 


and  hence 


(6.3)  | 


rcti  -1 

I  =  f  (L) 


r  a 

i 

-7  l  i f  r  pa* 

III 

711 

L  -y.nl 

A  +  *ni.  J[L 

721 

”L  +  !,t  +f‘nll 

J  l‘2tJJ 


The  first  term  on  the  rigth  hand  side  is  the  target  2T(L)yt  and  the  equation 
(6.3)  is  the  target  relation.  Now  different  models  for  y^  will  give  different 
behaviour  of  c^  and  1^.  We  shall  consider  two  cases 
Case  1 : 


Ay, 


g  +  e 


3t 


which  specifies  a  random  walk  with  drift  for  y^ ,  and 
Case  2: 


Av. 


g  +  A  c 


3t 


«  '  Z"2s- 
s-0 


both  cas<»r.  has  A  y^  stationary  with  zero  j:u'an,  and  since  a  target  relation  is 


p;i.ven  up  to  stationary  terms  wo  find  that  (6.3)  reduces  to 


((5.4) 


yfc  -  dy  /?  +  stationary  terms 


(6.5)  1  -  yt  i-  /lyt((/5-l)-»2|  4  1n^72l”,12  +  stationary  terms. 

1  f  we  now  take  expectations  given  the  negative  past  we  get 

E(ct)  =  E(yt)  -  2/^21 

E  ( 1 1 )  =  E(yfc)  +  ((/5-l)721  +  y  21*  ±2 

which  are  the  equations  one  would  get  by  formally  letting  4y  =  g  be 
non-stochastic  and  equating  the  e’s  to  zero,  see  Davidson(1983) . 

Thus  both  cases  give  rise  to  the  same  long-term  relations,  but  it  is  seen 
that  in  case  1,  we  have  that  dy^  -  g  is  stationary,  and  hence  y^_  is  integrated 
of  order  1  and  therefore  the  same  holds  for  c  and  1  .  Thus  (6.1)  and  (6.2) 
form  an  error  correction  model  with  an  autoregressive  representation  of  dc^, 

<dl t ,  and  dy^,  where  c^  and  1^  are  cointegrated  with  y^  such  that  c^-y^  and 

1.  are  stationary  and  enter  the  equations  with  suitable  coefficients. 

2 

In  case  2,  however,  d  y^  is  stationary  which  implies  that  c^.  and  1^  are 

integrated  of  order  2.  Thus  by  differencing  (6.1)  and  (6.2)  we  get  an 

2  2 

autoregressive  representation  of  d  c^.  and  d  1^..  Note  that  in  this  case 

and  e  enter  only  in  the  differenced  form,  which  shows  that  the  main 

2 

contribution  to  the  variance  of  c^  and  1  comes  from  d*~y^  and  the  error 

correction  terms  d(  1  -y  )  and  d(c^-y^_).  Thus  the  interpretation  of  (6.1)  and 

(6.2)  as  an  error  correction  model  depends  on  the  model  specification  for  the 

oxogeneous  variable  y.. 

j  t 

We  shall  now  show  how  the  formal  procedures  developed  in  Sections  3-5  can 
b‘  applied  to  this  example.  If  we  write:  the  case  2  in  the  autoregressive  form 


(6.6) 


where  X,  ’  (<\  ,  1  ,y\  )  then 


A(U\  ■  f  t 


A(L) 


7llM(l“7ll)  712(1“^  -yirJ12+A(yU+y12  fl)  } 

'21 


70,(l-d)  A  -?.,,(!  A) 


21' 

0 


0  A  J 

which  by  the  analysis  of  Section  4  has  k’  =  1,  n*  =  1,  and  r*  =  2,  whereas  p  = 

0.  Hence  Theorem  4.1  shows  that  we  shall  multiply  (6.6)  by  A  p  =  A.  It 


is  easily  seen  that  A^,  A  0,  such  that  the  resulting  equation  gives  an 

error  correction  model  of  order  2,  since  r’-n’+k’-p  =  2.  We  then  obtain  the 
information  from  the  analysis  that  ^(c^-y^)  and  4(l^.-y^)  are  the  stationary 
error  correction  terras.  Note  that  r’  >  n’  such  that  we  are  in  the  unbalanced 

case. 

One  can  reduce  to  the  case  r’  =  n’  by  introducing  new  variables  which  are 
found  by  analysing  A(L)’,  see  Theorem  7.5,  and  the  comments  at  the  end  of 
Section  7.  It  turns  out  that  the  new  variables,  in  which  the  problem  becomes 
balanced  are 


Ult  =  ct  “  yt  +"V721 

u2t  =  Jt  -  yt  Myt(721(1"/5)-Tll)/712-r21 

U3t  =  yt 


In  terms  of  the  new  variables  the  equations  now  become 

7ilult  +  712U2t  f  ( 1  7li)^ult  ~  y12Au2t  -  {1~fi  ~y211)A\t  =  elt 
721uU  ■  721dUlt  *  Au2t  ‘  (/3  "  1  +  7ll72i1)7121)-dSt  =  £2t 

A  U3t  =  £3t 

and  lienee  we  get  an  error  correction  model  with  an  autoregressive  model  for 
explained  by  the  error  correction  terms  which  are  recognised  as  the 
target  errors  (6.4)  and  (6.5). 

If  we  modify  the  equation  (6.1)  to  give  an  integral  correction  equation 
we  get 

dot  :  PA y,  *  •'ll(yt.,-<q  ,)  *  1(yt  rct  -1}  +  "it 


(6.7) 


which  together  with  (6.2)  determines  a  system  of  equations  for  (c^,l^).  We 
solve  (6.7)  in  the  form 

ct  =  yt  +  (^2yt(/3~l)  +  de1(.)/(<?1L  +  +^2(1-»1]l))- 

2 

This  shows  that  if  d  is  stationary,  then  c^-y^  is  stationary,  and  hence 
is  integrated  of  order  2,  and  the  target  relation  is 

Cj.  =  y^  +  stationary  terms. 

The  similar  relation  for  1^  becomes 

1^  =  ( i_^)721^yt^ll^  +  ^  ^e2t  +  stationary  terms. 

The  formal  analysis  proceeds  as  follows:  We  multiply  through  by  d  to 
avoid  the  negative  power.  We  then  have  an  expression  of  the  form  (4.1)  with  d 


1  and 


A(L)  = 


I  J/1  A  /  »  A  \  1 


•^2 


-791d(i-d) 

d3 


We  find  k’  =  2,  n*  =  4,  and  r’  =  5,  but  some  calculation  shows  that  A  .  ,  ,  = 

n’-k 

A^  +  0.  Hence  the  process  (c^l^y^)  is  of  order  r’-n’+k’-p  =  2,  and  we  shall 

9  “'ll 9  "d 9  0 

multiply  through  by  d  =  d  in  order  that  the  equation  can  be 

interpreted  as  an  error  correction  model.  We  thu3  find  that  c^-y^  is  a 

stationary  error  correction  term  which  appears  with  coefficient  -<?j^  in  the 

2  .  2 

expression  for  d  c^.  and  in  the  form  ~^721^ct  yt^  in  *-he  equation  for  d  1^. 

Since  i ’  >  n’  we  can  introduce  a  new  variable,  and  an  investigation  of 
tn»'  columns  of  A(L),  or  the  rows  of  A(L)’,  will  show  that  the  new  variable  is 


ut  =  Ct  ‘  yt  ~a  yt(n  ])/°\\ 

together  with  1^  and  y  .  In  the  new  variables  the  matrix  A  becomes 

Pi*  4:'' ' M  <’n'4<l  ’n'lW-U/ajL  ] 

A(I.)  ••  I  79|d(L  d)  A1  7.  1  tl  l)/c/M  | 

I  ^ 1  "  :<  1 1  1 

00  A 


which  is  seen  to  have  k’  =  3,  n*  =  r’  =  5.  Hence  the  process  is 

integrated  of  order  r'-n'+k’-d  =  2,  and  we  shall  multiply  through  by  A  a 

-  A  Thus  we  shall  cancel  a  factor  A  again  and  we  find  that  the  error 

correction  terms  now  are  A  ^u^u^.du^,  and  dl^.  which  appear  in  the 

2  2 

autoregressive  model  for  A  y^.  Thus  we  find  that  =  c^_-  y^.  is 

in  fact  integrated  of  order  -1,  which  means  that  c^  can  be  tracked  extremely 

2 

we  LI  by  v  +  A  y  (j3-l)/c?^ ,  in  the  sense  that  the  error  does  not  accumulate, 

t 

i.e.  the  sum  of  the  target  errors  2  u  has  a  bounded  variance. 

s=0  S 

The  next  example  we  shall  consider  is  a  model  proposed  by  IIarvey(1982) 
for  a  stochasticaLLy  varying  trend. 

Example  6.2.  Let  the  variables  ,  and  x^  be  given  by  the 

equations 


nij.  +  co<t  +  e. 


It 


•*"t  =  \  +  e2t 
=  63t 
=  *4t. 

This  is  an  autoregressive  model,  and  we  find 


A(L)  = 


r  i  -i 

i  0  A 


0 
-1 

i  0  0  A 
[  0  0  0 

.3 


-a  ] 

0  I 
0  I 
4  J 


which  has  determinant  det  A(L)  -  A'  ,  and  hence  r*  =  3,  and  we  find  k’  =  1  and 
n’  -  2.  Thus  we  are  in  the  unbalanced  case,  but.  it  is  easily  seen,  that 


A  , _j  ,  -  Aj  £  0.  Hence  Theorem  4.1  shows  that,  the  process  is  integrated  of 

pi  — n  >  __p 

order  r’-n’hk’  p  -  2,  and  that  a  factor  of  A  -  A  is  missing  in  the 

equations  before  they  can  be  interpreted  as  an  error  correction  model.  Then 

2 

t lie*  term  involving  A  will  be  the  autoregressive  part  and  those  with  A  the 


-  2(i- 


error  correction  terms.  It  then  follows  that  d(y^.-  m^-ax^)  and  are  the 
stationary  error  correction  terms,  and  that  the  only  error  correction  equation 
where  anything  is  corrected  is 

A\  =  ^t  +  A*2f 

It  is  seen  that  the  main  contribution  to  the  variance  of  m^  is  due  to  the 
stationary  error  correction  term  Note  that  y  and  are  integrated  of 

order  2,  and  that  y^-m^.  is  integrated  of  order  1,  whereas  y^-ra^-ax^  is 
integrated  of  order  0..  Hence  we  have  an  example  where  three  variables  are 
involved  which  are  not  of  the  same  order. 

Since  r*  >  n*  a  differencing  was  needed.  It  can  be  avoided  by 
introducing  the  new  variable  /^.  The  problem  is  now  balanced,  and 

the  equations  become 


oxt  = 

elt 

ut 

e2t 

-  *du. 

=  . 

t 

3t 

dx. 

t 

4t 

In  this  case  k’  =  2,  n*  =  r*  =  3,  and  the  process  is  of  order  r’-n'+k’-p  =  2. 
Hence  the  imbalance  in  A  has  been  removed  by  the  change  of  variables  and  the 
new  equation  can  be  viewed  as  an  error  correction  model  , where  we  now  get  the 
information  that  y^-m^-ax^  and  are  the  stationary  error  correction  terms, 
whereas  is  the  error  correction  term  of  order  1.  The  relevant  error 
correction  model  now  becomes 


d  mt  =  A,l  +  e3t 

where  now  the  error  correction  term  contributes  less  than  the  shocks  to  the 
variation  of  y^. 

7.  Mathematical  results 


Consider  a  matrix  valued  function  C(z)  which  is  defined  in  an  open  disc 
D  =  {z  ;  |z|  <  1+p  }  in  the  complex  plane.  The  function  is  colled  holomorphic 
if  the  n’th  derivative  exists  for  all  n,  and  it  is  a  well  known  result  that 
the  Taylor  series  expansion  of  a  holomorphic  function  at  a  point  z  e  D 
converges  in  the  largest  open  disc  contained  in  D,  see  for  instance 
Thron(1953) . 

We  shall  investigate  the  function  C(z)  around  the  point  z  =  1,  and  we 

assume  that  the  matrix  C(z)  is  non-singular  for  z  A  1,  and  that  C(l)  is 

singular,  but  £  0.  We  define  the  coefficients  {C .  ;  j  =  0,1, . . .  }  by  the 

J 

expansion 

00 

C(z)  =  2  (l-z)JC  ,  |l-z|  <  p 
j=0  J 

We  shall  repeatedly  use  the  fact  that  if  C  (z)  is  defined  by 

n 

n-1 

C(z)  =  2  (l-z)JC.  +  ( l-z)nC  (z) 

j=0  J 

then  C^Cz)  is  a  holomorphic  function  in  D.  This  follows  since  the  functions 
n-1  '  -n 

C(z),  Z  (i-z)JC.,  and  (1-z)  are  holomorphic  in  D  as  long  as  z  #  1.  At  the 

j=0  J 

CO 

point  1,  however,  the  function  C  (z)  has  the  expansion  C  (z)  =  Z  (l-z)JC. 

n  n  j—Q  J+n 

|z-l|  <  p,  which  shows  that  C^(z)  is  also  holomorphic  at  the  point  1.  We 

want  to  give  a  representation  for  the  determinant  of  C(z)  and  for  C(z)  the 
adjoint  matrix  defined  by 

C.j(z)  =  {-l)1+^det  CJ1(z) 

where  CJ1(z)  is  obtained  from  C(z)  by  deleting  row  j  and  column  i.  Note  that 

C(z)  is  also  holomorphic  since  each  element  is  given  as  a  finite  sum  of  finite 
products  of  holomorphic  functions. 


f  ' 

k - 


L  l 
t  •  v« 


We  define  the  coefficients  {C .  ;  j  =  0,1, —  }  by  the  expansion  at  z  =  1 

J 

03 

C(z)  =  Z  (l-z)JC..  We  define  the  null  spaces 
j=0  J 

N.  =  {  x  e  j  x’C  .  =  0},  j  =  0,1,... 

0  J 


m.  =  i  n  n,  n  n  n.. 

J  0  1  J 

Then,  since  C(z)  is  regular,  there  is  no  vector  x  which  makes  all  C  .  zero, 

J 

hence  =  {0}  for  j  >  k,  say.  Note  that  must  be  non-zero,  and  that  * 

{0},  and  that  Cq  =  C(l)  £  0  implies  that  Mq  £  R0.  We  shall  now  define  the 

CO 

index  n  =  Z  m.,  where  m.  is  the  dimension  of  M..  The  basic  idea  is  that  if  x 
j=0  J  J  J 


e  f  1 . ,  then 
J 


x‘C(z)  =  (l-z)j+1  x’Cj+1(z), 


since  the  first  j+-l  terms  x’CQ, . . . ,x’C  .  are  zero.  This  corresponds  to  the 
idea  that  x  is  a  cointegration  factor  of  order  j+1,  and  we  shall  use  this  to 
evaluate  the  determinant. 

Theorem  7.1.  The  multiplicity  r  of  the  root  z  -  1  of  det  C(z)  is 
greater  than  or  equal  to  n,  hence  there  exists  a  function  f(z)  £  0,  such  that 

det  C(z)  =  (l-z)rf(z). 

Proof .  We  want  to  choose  a  convinient  coordinate  system  to  evaluate 

the  determinant  and  this  is  done  ns  follows:  From  the  relation 

Rra  =>  Mq  d  M1  d  . . .  d  Mk  =  {0} 
wo  get  an  orthogonal  decomposition  of  Fi™ 


-  V  -f  V  +  V 
0  l  k’ 


where 


v .  -  m .  ,  n  m  1  =  n„  n  ...  n  n.  .n  n  1 

J  J-l  J  o  j-1  J 

is  of  dimension  m.  ,-rn..  Note  that  C(l)  *  0  implies  that  V„  #  {0}  and  that 
J  l  J  V 


F 


t  j 


the  definition  of  k  implies  that  £  {0}. 

We  define  C(z)  by  x’C(z)  =  x’C.(z),  for  x  e  V.,  then 

J  J 

(7.1)  x’C(z)  =  (l-z)'^x’C(z)  ,  x  <=  V. 

J 

where  x*C(l)  t  0,  since  x  tf  M.«  Thus  V.  is  the  space  of  cointegration  factors 

J  J 

of  order  j. 

Now  choose  a  basis  {v^;  j  =  l,...,m  }  for  R1”,  such  that  the  vectors  given 

by  {v^;  j  =  m-iiu -1+1*  *  ‘  *  »Ta_miL  }  span  \h.  We  use  the  notation  =  m.  We 

define  the  order  of  v .  to  be  i(j),  thus  i(j)  =  i  if  v .  e  V..  Note  that 

0  J  l. 

U,  00  CO 

max  i ( j )  =  k  and  that  2  i(j)  =  2  i(m._,-m.)  =  2  m.  =  n.  From  (7.1)  we  find 
l<j<m  j=l  i=0  111  i=o  1 

v’C(z) v  -  C  (z)  —  (l-z)l(p)C  (z) 

p  q  pq  pq 

o; 

C(z)  =  diag{  (l-z)*^  ;p  =  l,...,m}  C(z) 
and  hence  that  det  C(z)  =  (l-z)ndet  C(z).  Now  let  r  be  the  multiplicity  of 
the  root  z  --  1  of  det  C(z),  then  r  >  n,  since  C(l)  may  be  singular.  This 
completes  the  proof  of  Theorem  7.1. 

Corollary  7.2  For  the  adjoint  matrix  we  have  the  result 

(7.2)  C.  j(z)  -  (-l)1+Jdet  CJ1(z)  =  (-l)1+J(l-z)rji  fJjL(z) 
where  f..(z)  £  0  and  r..  >  n-p  when  v.  e  V  . 

ji  ji  j  p 

Proof  Fly  deleting  the  row  j  with  v.  e  V  from  C(z)  we  leave  out  a 

J  P 

factor  (l~z)P  in  the  determinant  and  the  index  n  is  reduced  by  p. 

CO 

The  next  result  gives  a  representation  of  the  matrix  C(z)  -  2  (l-z)*C. 

j=0  J 

Theorem  7.3  The  coefficients  of  the  adjoint  matrix  satisfies 

(7.3)  C  .  -  0  ,  j  -  0,1,..., n-k-1 

J 


and  hence 


Further 


(7.5)  C  .C.  =  0  ,0  <  i  <  j  <  k 

n-j  i 

which  shows  that 

(7.6)  Cn_jC(z)  =  (l-z)JCn_jCj(z)  ,  j  =  1 . k. 

Proof  Let  us  consider  the  coordinate  system  (v.;  j  =  l,...,m  }  from 

J 

the  proof  of  Theorem  7.1,  and  express  C(z)  in  these  coordinates.  From  (7.2) 

we  find  that  r..>  n-p  when  v.  e  V  ,  but  we  have  n-p  >  n-k.  Thus  the  smallest 
Ji  J  P 

power  that  can  occur  in  the  expansion  of  C(z)  is  n-k,  which  shows  that  C.  =  0, 

J 

j  <  n-k,  and  this  proves  (7.3)  and  (7.4).  To  prove  (7.5)  and  (7.6)  we  will 
show  that 


(C  .)  (C.)  =  0  for  all  p,q,r  and  0  <  i  <  j  <  k. 

n-J  pq  l  qr 

We  then  get  (7.5)  by  summing  over  q.  Now  if  q  <  m-m.  ,,  then  v  e  V_  +  . . . 

J-l  q  0 

+  V  ^  and  >  n-(j-l).  Thus  the  smallest  power  in  the  expression  for 

C(z)  is  (l-z)n  ^  ^  which  shows  that  (C  .)  =  0. 

Pq  n-j  pq 

Similarly  if  q  >  m-m.,  i.e  v  e  V.  ,  +  ...+  V.  =  M.,  then  clearly  v  ’C.  = 

1  q  l+l  k  l  q  l 

0,  and  hence  (C.)  =0.  Now  if  we  take  i  <  j,  then  m-ra.  <  ra-m..,  which 

l  qr  l  J  i 

shows  that  all  q  values  were  considered,  and  this  completes  the  proof  of  the 
relation  (7.5)  and  (7.6). 

The  relation  (7.5)  shows  that  the  rows  of  C  .  are  contained  in  the  null 

n-j 

spaces  of  whenever  i  <  j,  and  in  particular  that  the  rows  of 


Cn-(i+l)’'",Cn-k 


are  contained  in  M. 

l 


Nj|  fl  ...  n  N.,  We  can  now  prove 


i  "  i  ■  i  ■  '  T*  9\v 


V  *1  ■  ■  ■  I  ■  w  n  ^1  mvwj'wi  m%  wy  -  |  »  v  ij  «ji  *y  m'V  "  ■  ■  ■  ■  1 


k  _ 

Theorem  7.4.  If  r  =  n  then  Z  C  _ .C .  is  proportional  to  the  identity  and 

j=0  n  J  J 


the  rows  of  C 
Proof. 

Theorem  7.1. 


.  C  .  span  M.  ,  i  =  0,1,..., k-1. 

n-i-1  n-k  l 

If  r  =  n  then  the  matrix  C ( 1 )  is  regular,  see  the  proof  of 
From  the  relation 


C(2)C(z)  =  det  C(z)  I 

mxm 

we  find  from  the  fact  that  Cn  =  . . .  =  C  ,  .  =  0  find  C  .C.  =  0  for  0  <  i  <  i 

0  n-k-1  n-j  l  J 

<  k,  that  the  first  possibly  non-zero  term  on  the  left  hand  side  is 

k  _ 

(l-z)n  ZC  .C. 
j=0  n'J  J 

k  _ 

and  if  det  C(z)  =  (l-z)nf(z),  f(z)  *  0,  then  Z  C  .C .  is  proportional  to  the 

j--0  J 

identity  and  hence  has  rank  m. 

Now  consider  the  terms  (C  .)  (C.)  .  From  the  proof  of  Theorem  7.3  it 

n-J  pq  J  qr 

follows  that 


and  that 


hence 


(C  .)  =  0  for  q  <  m-m .  , 

n-j'pq  M  j-1 


(C.)  =  0  for  q  >  m-m. 

J  qr  M  J 


(C  .C.) 
n-j  J  pr 


(C  .)  (C.) 

n~j  pq  j  qr 


where  the  summation  is  for  q  such  that  m-m.  .  <  q  <  m-m.  or  v 

j-i  j  q 


shows  that  the  rcink  of  C  C  .  is  less  than  or  equal  to  m.  ,-m. 

n-j  J  j-1  j 

evaluate  as  follows 


e  V..  This 
J 

We  then 


k  _  k  k 

m  -  rank(2  C  .C.)  <  Z  rank(C  ,C.)  <  Z  (n.  ,-ra.)  =  m. 
j=0  n_J  J  j=0  J  j-0  >~l  J 

It  follows  that  equality  holds  throughout  and  that 
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rarikC  .  >  rank(C  .C  .)  =  ra  .  ,-m.. 
n-J  n-j  j  j-1  j 

This  completes  the  proof,  since  then 

_  _  k 

m.  >  rank(C  .  .  )  >  1  (m.  ,-m.)  =  m. 

1  "-1-1  n'k  j=i+i  j  *  j 

which  shows  that  the  matrices  on  the  left  span  all  of  M.. 

If  r  >  n  we  do  not  get  so  complete  information,  but  we  shall  show  below 

how  the  case  r  >  n,  the  unbalanced  case,  can  be  reduced  to  the  case  r  =  n,  the 

balanced  case.  The  idea  is  that  instead  of  taking  only  linear  combinations  of 

the  x’s  we  allow  powers  of  A  in  the  coefficients.  We  can  then  prove,  that  by 

transforming  the  variables  we  can  increase  n  while  keeping  r  fixeu,  and  we 

thus  reduce  to  the  balanced  case  after  at  most  r-n  transformations.  A  similar 

idea  is  the  starting  point  for  the  work  of  Yoo(1985).  If  r  >  n  then  C(l)  is 

m 

singular  and  we  can  find  a  vector  a  =  2  a  v  e  R®  such  that  a’C(O)  =  0.  Let  s 

P=1  P  P 


he  the  largest  j  for  which  a.  A  0,  and  define  T  (z)  by 

J  Q 


v.’T  (z)  =  1-1 


J  a 


a  *  I  (1-z) 

s  i 

P=1 


i(s)-i(p) 


a  v 
P  P 


,  j  *  s 
,  j  =  s 


We  can  then  prove 

Theorem  7.5  The  matrix  function 

C  (z)  =  T  (z)C(z) 

a  cl 

has  indices  (r  ,n  ,k  ),  where  r  =  r,  n  >  n  +  1,  and  k  >  k. 

a  a  a  a  a  a 

Proof.  Since  det  T  (z)  =  1  we  clearly  have  r  =  r.  For  .i  A  s.  we  have 

v  ’C^(z)  =  v.’C(z),and  it  follows  that  the  order  of  v.  is  the  same  for  C  (z) 

J  ^  J  J  3 

tis  for  C(s).  If  j  =  s,  then 


a  v  ’C  (z)  =  Z  (l-z)l(s)  l(p)a  v  ’C(z) 
s  s  a  j  P  P 

=  Z  (l-z)i(s)~i(p)a  (l-z)i(p)v  ’C(z) 
P=1  P  p 
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=  (l-z)l(s)a’C(z). 

Hence,  since  a’C(l)  =  0,  we  get  that  the  order  of  vg  is  >  i(s)  +  1,  which 
implies  that  k  is  at  least  as  large  as  k,  whereas  n  is  greater  than  n. 

Q  £L 

Let  us  briefly  discuss  the  relevance  of  the  above  formulation  for  the 

theory  of  time  series.  The  reason  that  the  holomorphic  functions  play  a  role 

CO 

is  that  if  {z  ;  -°°  <  t  <  “  }  is  stationary,  and  if  B(z)  -  Z  z1B.  is 

i=0  1 

00 

£ 

holomorphic  for  |z|  <  1+p  then  the  process  y.  =  B(L)z,  =  Z  B.z.  .  is  a 

z  z  i=0  1  1 

stationary  process.  The  coefficients  decrease  exponentially  fast  in  i, 
which  shows  that  the  process  {y^.}  is  well  defined  and  it  is  easy  to  see  that 
it  is  stationary. 

We  have  throughout  considered  the  matrices  as  linear  transformations  of 
the  row  vectors,  using  the  notation  v’C(z).  This  comes  from  the  fact  that  in 
the  moving  average  model  (3.1)  the  change  of  variable  y^_  =  Txfc  gives  the 
relation 

Afc  =  TC(zHt. 

Thus  by  choosing  a  suitable  T  we  can  change  the  variables  to  find  a  convinient 
coordinate  system  in  which  to  calculate  the  determinant  or,  in  case  T  depends 
on  I,,  to  reduce  the  unbalanced  case  to  the  balanced  case. 

If  the  starting  point  of  the  investigation  is  the  autoregressive  model 
(4.1)  then  the  change  of  variable  y^  -  Tx^.  implies  the  equation 

A( LIT  ^yfc  -  [T’~]A(L)’]’yt  =  Afc. 

Thus  the  inverse  of  T’  operates  on  A(L)’.  Now  it  is  easy  to  see  that  from  the 

definition  of  the  adjoint,  it  follows  that  (A(L)’)  =  (A(L))’,  and  hence  that 
the  results  of  the  previous  Theorems  can  be  aplied  without  problems. 

Let  us  end  this  section  by  giving  explicitly  the  transformation  of  the 
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variables  that  increases  n  by  at  least  1. 


If 


£ \  =  C(L)fct 


and  a’C(l)  =  0,  then  we  can  choose  y.  =  T  (L)x.  as  follows 

tat 

r  xjt  ,  j  *  s 

yjt  _  J  -1®  i(s)-i(p) 

La  Z  A  ax,  ,  J  =  s  . 

sp_l  P  Pt 

If 


A(L)xt  =  A 

and  a’A(l)’  =  0,  then  one  can  introduce  the  variable  by 

r  -1  .i(s)-i(j)  .  , 

f  x ..  -  a  A  w  a  .x  ,  j  <  s 

jt  s  j  st 


Jt '  i  -jt 


J  2:  s. 


Examples  of  this  are  given  in  Section  6. 
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